Automated chart review utilizing natural language processing algorithm for asthma predictive index
نویسندگان
چکیده
BACKGROUND Thus far, no algorithms have been developed to automatically extract patients who meet Asthma Predictive Index (API) criteria from the Electronic health records (EHR) yet. Our objective is to develop and validate a natural language processing (NLP) algorithm to identify patients that meet API criteria. METHODS This is a cross-sectional study nested in a birth cohort study in Olmsted County, MN. Asthma status ascertained by manual chart review based on API criteria served as gold standard. NLP-API was developed on a training cohort (n = 87) and validated on a test cohort (n = 427). Criterion validity was measured by sensitivity, specificity, positive predictive value and negative predictive value of the NLP algorithm against manual chart review for asthma status. Construct validity was determined by associations of asthma status defined by NLP-API with known risk factors for asthma. RESULTS Among the eligible 427 subjects of the test cohort, 48% were males and 74% were White. Median age was 5.3 years (interquartile range 3.6-6.8). 35 (8%) had a history of asthma by NLP-API vs. 36 (8%) by abstractor with 31 by both approaches. NLP-API predicted asthma status with sensitivity 86%, specificity 98%, positive predictive value 88%, negative predictive value 98%. Asthma status by both NLP and manual chart review were significantly associated with the known asthma risk factors, such as history of allergic rhinitis, eczema, family history of asthma, and maternal history of smoking during pregnancy (p value < 0.05). Maternal smoking [odds ratio: 4.4, 95% confidence interval 1.8-10.7] was associated with asthma status determined by NLP-API and abstractor, and the effect sizes were similar between the reviews with 4.4 vs 4.2 respectively. CONCLUSION NLP-API was able to ascertain asthma status in children mining from EHR and has a potential to enhance asthma care and research through population management and large-scale studies when identifying children who meet API criteria.
منابع مشابه
IDENTIFICATION OF ADVERSE EVENTS RELATED TO CENTRAL VENOUS CATHETERS USING TWO SEMIAUTOMATED METHODS TO REVIEW FREE TEXT COMPUTERIZED MEDICAL RECORDS by
Methods for surveillance of adverse events in clinical settings are limited by cost, technology, and appropriate data availability. Manual chart review is the gold standard methodology against which any new technology must be compared at this time. In this study, two methods for semiautomated review of text records within the Veterans Administration database are utilized to identify adverse eve...
متن کاملResearch Paper: Automated Detection of Adverse Events Using Natural Language Processing of Discharge Summaries
OBJECTIVE To determine whether natural language processing (NLP) can effectively detect adverse events defined in the New York Patient Occurrence Reporting and Tracking System (NYPORTS) using discharge summaries. DESIGN An adverse event detection system for discharge summaries using the NLP system MedLEE was constructed to identify 45 NYPORTS event types. The system was first applied to a ran...
متن کاملThe Super Annotator: A Method of Semi-Automated Rare Event Identification for Large Clinical Data Sets
Detecting rare events in an electronic medical record (EMR) is similar to searching for a needle in a haystack. Automated methods (natural language processing) and manual methods (chart review) can be used. However, when events are rare and the patient population is large, manual annotation may not be feasible and NLP alone may not be able to reach acceptable levels of performance. We propose a...
متن کاملImportance of multi-modal approaches to effectively identify cataract cases from electronic health records
OBJECTIVE There is increasing interest in using electronic health records (EHRs) to identify subjects for genomic association studies, due in part to the availability of large amounts of clinical data and the expected cost efficiencies of subject identification. We describe the construction and validation of an EHR-based algorithm to identify subjects with age-related cataracts. MATERIALS AND...
متن کاملIncreasing the efficiency of trial-patient matching: automated clinical trial eligibility Pre-screening for pediatric oncology patients
BACKGROUND Manual eligibility screening (ES) for a clinical trial typically requires a labor-intensive review of patient records that utilizes many resources. Leveraging state-of-the-art natural language processing (NLP) and information extraction (IE) technologies, we sought to improve the efficiency of physician decision-making in clinical trial enrollment. In order to markedly reduce the poo...
متن کامل